Evaluating Visual Representations for Topic Understanding and Their Effects on Manually Generated Labels

نویسندگان

  • Alison Smith
  • Tak Yeon Lee
  • Forough Poursabzi-Sangdeh
  • Jordan L. Boyd-Graber
  • Niklas Elmqvist
  • Leah Findlater
چکیده

= {Probabilistic topic models are important tools for indexing, summarizing, and analyzing large document collections by their themes. However, promoting end-user understanding of topics remains an open research problem. We compare labels generated by users given four topic visualization techniquesword lists, word lists with bars, word clouds, and network graphsagainst each other and against automatically generated labels. Our basis of comparison is participant ratings of how well labels describe documents from the topic. Our study has two phases: a labeling phase where participants label visualized topics and a validation phase where different participants select which labels best describe the topics’ documents. Although all visualizations produce similar quality labels, simple visualizations such as word lists allow participants to quickly understand topics, while complex visualizations take longer but expose multi-word expressions that simpler visualizations obscure. Automatic labels lag behind user-created labels, but our dataset of manually labeled topics highlights linguistic patterns (e.g., hypernyms, phrases) that can be used to improve automatic topic labeling algorithms.},

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluating Visual Representations for Topic Understanding and Their Effects on Manually Generated Topic Labels

= {Probabilistic topic models are important tools for indexing, summarizing, and analyzing large document collections by their themes. However, promoting end-user understanding of topics remains an open research problem. We compare labels generated by users given four topic visualization techniquesword lists, word lists with bars, word clouds, and network graphsagainst each other and against au...

متن کامل

Understanding and Using Patterns of Food Labeling Systems and their Determinants by Medical Students of Tabriz University of Medical Sciences, Iran

Background and Objectives: Increased public knowledge concerning roles of nutrition in prevention of non-communicable diseases have urged people to select healthy foods. The aim of this study was to investigate levels of understanding and use of food labeling systems and their determinants by medical students of Tabriz University of Medical Sciences, Tabriz, Iran. Materials and Methods: In a c...

متن کامل

Evaluating Visual Preferences of Architects and People Toward Housing Facades, Using Multidimensional Scaling Analysis (MDS)

One of the most important issues that have absorbed the public opinion and expert community during the recent years, is the qualitative and quantitative aspects of the housing. There are several challenges related to this topic that includes the contexts of the construction, manufacturing, planning to social aspects, cultural, physical and architectural design. The thing that has a significant ...

متن کامل

Mining Adverse Events of Dietary Supplements from Product Labels by Topic Modeling

The adverse events of the dietary supplements should be subject to scrutiny due to their growing clinical application and consumption among U.S. adults. An effective method for mining and grouping the adverse events of the dietary supplements is to evaluate product labeling for the rapidly increasing number of new products available in the market. In this study, the adverse events information w...

متن کامل

Evaluating Multimodal Representations on Sentence Similarity: vSTS, Visual Semantic Textual Similarity Dataset

The success of word representations (embeddings) learned from text has motivated analogous methods to learn representations of longer sequences of text such as sentences, a fundamental step on any task requiring some level of text understanding [13]. Sentence representation is a challenging task that has to consider aspects such as compositionality, phrase similarity, negation, etc. In order to...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • TACL

دوره 5  شماره 

صفحات  -

تاریخ انتشار 2017